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The small envelope protein (E) plays a role of central importance in the assembly of coronaviruses. This was 
initially established by studies demonstrating that cellular expression of only E protein and the membrane 
protein (M) was necessary and sufficient for the generation and release of virus-like particles. To investigate 
the role of E protein in the whole virus, we previously generated E gene mutants of mouse hepatitis virus 
(MHV) that were defective in viral growth and produced aberrantly assembled virions. Surprisingly, however, 
we were also able to isolate a viable MHV mutant (AE) in which the entire E gene, as well as the nonessential 
upstream genes 4 and 5a, were deleted. We have now constructed an E knockout mutant that confirms that the 
highly defective phenotype of the AE mutant is due to loss of the E gene. Additionally, we have created 
substitution mutants in which the MHV E gene was replaced by heterologous E genes from viruses spanning 
all three groups of the coronavirus family. Group 2 and 3 E proteins were readily exchangeable for that of 
MHV. However, the E protein of a group 1 coronavirus, transmissible gastroenteritis virus, became functional 
in MHV only after acquisition of particular mutations. Our results show that proteins encompassing a 
remarkably diverse range of primary amino acid sequences can provide E protein function in MHV. These 
findings suggest that E protein facilitates viral assembly in a manner that does not require E protein to make 
sequence-specific contacts with M protein. 


Coronaviruses are a family of enveloped, positive-sense 
RNA viruses that infect numerous mammalian and avian spe¬ 
cies and have gained recent widespread attention due to the 
emergence of severe acute respiratory syndrome (SARS) (37, 
48). Virion assembly of coronaviruses is the culmination of a 
series of interactions among a minimal set of four structural 
proteins and the viral genome at the site of budding in the 
endoplasmic reticulum-Golgi intermediate compartment (12). 
Three of the structural proteins are membrane bound and 
become incorporated into the virion envelope. These are the 
spike protein (S), which initiates infection through attachment 
to host cell receptors and fusion with host membranes; the 
membrane protein (M), the major constituent of the envelope; 
and the small envelope protein (E). In the interior of the 
virion, the fourth component, the nucleocapsid protein (N), 
forms a helical nucleocapsid with the RNA genome. 

Much of our knowledge about coronavirus assembly has 
come from studies of virus-like particles (VLPs) produced by 
coexpression of virion structural proteins (12). The earliest 
investigations employing such systems demonstrated that co¬ 
expression of mouse hepatitis virus (MHV) M and E proteins 
was necessary and sufficient for VLP formation and release 
from cells (3, 47). Prior to this discovery, the significance of E 
protein had not been realized. It was subsequently shown that 
multiple other coronaviruses—bovine coronavirus (BCoV) (2), 
transmissible gastroenteritis virus (TGEV) (2), infectious 
bronchitis virus (IBV) (4, 6), and SARS coronavirus (SARS- 
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CoV) (39)—conformed to the same rule. The only apparent 
exception to this pattern has been a report that the M and N 
proteins were necessary and sufficient for SARS-CoV VLP 
formation (23). M protein is a 25-kDa protein containing three 
transmembrane segments, with a short amino-terminal ectodo- 
main and a large carboxy-terminal endodomain. M is the most 
abundant viral structural protein, and envelope formation is 
thought to largely be driven by M-M monomer interactions. 
The role of E protein is much less clear. 

The E protein is a small polypeptide, ranging from 76 to 109 
amino acids (8.4 to 12 kDa), and is a minor constituent of 
virions. E proteins all have large predicted hydrophobic do¬ 
mains, and it has been shown for MHV (47), IBV (4), and 
SARS-CoV (31) that E is an integral membrane protein, al¬ 
though it does not contain a cleavable signal sequence (43). 
The E proteins of IBV (5) and SARS-CoV (31) are palmitoy- 
lated on one or more cysteine residues, but it is currently 
unclear whether the TGEV or MHV E proteins share this 
modification (17, 43, 54). Also unresolved is the membrane 
topology of E. An early report suggested a C-exo, N-endo 
membrane orientation for the TGEV E protein (17). Contrary 
to this, the IBV E protein was observed to adopt an N-exo, 
C-endo orientation (4, 55). Different still, evidence obtained 
with the MHV E protein is consistent with an N-endo, C-endo 
topology (36, 43), and such a hairpin transmembrane configu¬ 
ration has also been proposed for the SARS-CoV E protein (1, 
26, 55). It is thus possible that the E proteins of different 
coronavirus species do not have a uniform membrane topology 
or that the orientation of E varies with the protein’s level of 
expression or oligomerization. 

Our laboratory has been using genetic methods to study the 
MHV E protein in the intact virus. In previous work we found 
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that certain charged-to-alanine mutations constructed in the E 
gene resulted in defective virus growth and in thermolability. 
The assembled virions of one such mutant had striking de¬ 
fects—elongated and pinched shapes—that were seldom seen 
among wild-type virions (14). This phenotype was clearly con¬ 
sistent with the critical role for E protein in virion assembly 
indicated by the VLP studies. It was therefore surprising that 
we were later able to generate a viable, albeit extremely de¬ 
fective, MHV recombinant (AE) in which the E gene, as well 
as the accessory genes 4 and 5a, were entirely deleted (30). 
This confirmed that although the E protein is important for 
MHV assembly, it is not absolutely essential. By contrast, for 
TGEV it was shown by two independent reverse genetic ap¬ 
proaches that if the E gene was knocked out, then viable virus 
could be recovered only if E protein was provided in trans 
(8, 41). 

In the present study, we investigated whether E proteins 
from different coronavirus species could functionally replace 
that of MHV. We found that heterologous E proteins with 
remarkably divergent amino acid sequences were able to sub¬ 
stitute for MHV E. Our results suggest that most coronavirus 
E proteins play similar roles in virion assembly, without the 
sequence-specific constraints that would be expected if E pro¬ 
tein were required to directly interact with M protein. 

MATERIALS AND METHODS 

Cells, viruses, and antibodies. The MHV strain A59 wild type and mutants 
were grown in mouse 17 clone 1 (17C11) or L2 cells; plaque assays and plaque 
purifications were carried out with L2 cells. The feline-murine interspecies chi¬ 
meric virus fMHV.v2 (18) was grown in feline FCWF cells. 

For the generation of antibodies specific for the MHV E protein, the carboxy- 
terminal tail of E (amino acids 38 through 83) was connected to an amino- 
terminal hexahistidine tag in the vector pET-28a(+) (Novagen). The bacterially 
expressed fusion protein was isolated by Ni-nitrilotriacetic acid resin affinity 
purification (Novagen) and sodium dodecyl sulfate-polyacrylamide gel electro¬ 
phoresis (SDS-PAGE) and was used to immunize rabbits. To test antiserum 
specificity, the MHV E and M genes were inserted into the Semliki Forest virus 
(SFV) replicon vector pSFVl (32) for protein expression in L2 cells. Rabbit 
antipeptide antibodies specific for the IBV E protein (amino acids 96 through 
109; PANFQDVQRDKLYS) were provided by Carolyn Machamer (Johns Hop¬ 
kins University School of Medicine) (4). Rabbit antipeptide antibodies directed 
against the TGEV E protein (amino acids 61 through 82; AYDAYKNFMRIK 
AYNPDGALLA) were obtained from Washington Biotechnology, Inc. Mono¬ 
clonal antibodies J.3.3 and J.E3 (15), specific for the MHV N and M proteins, 
respectively, were provided by John Fleming (University of Wisconsin, Madison). 

Plasmid constructs. Transcription templates for viral mutant construction 
were derived from pMH54 (28), which contains cDNA corresponding to the 
5'-most 0.5 kb of the MHV genome fused to the 3'-most 8.6 kb of the MHV 
genome (see Fig. 1A). All mutations and substitutions were generated via splic¬ 
ing overlap extension-PCR (22) and were inserted into pMH54 by exchange of 
the segment between the Sbfl site following the S gene and either the EcoRV site 
at the end of the E gene or the EagI site near the start of the M gene. In pLKlOl, 
the open reading frame (ORF) for the E protein was knocked out with nine point 
mutations that disrupted the start codon and created stop codons in all three 
reading frames immediately downstream. 

Heterologous E ORFs were substituted in pMH54 by an exact exchange of 
start codons, at their upstream ends, and in such a manner at their downstream 
ends as to preserve the context of the transcription-regulating sequence (TRS6) 
for the M gene. PCR templates for the substitution mutants were cDNA clones 
obtained from David Brian (University of Tennessee) for the BCoV and TGEV 
E genes and from Carolyn Machamer (Johns Hopkins University School of 
Medicine) for the IBV E gene. The SARS-CoV E gene was amplified from a 
cDNA clone made from infected cell RNA provided by Jill Taylor (Wadsworth 
Center). The MHV-A59 E ORF sequence in this study was identical to that 
found under GenBank accession number AY700211. The BCoV strain Mebus E 
ORF was the same as that under GenBank accession number U00735, except 
that we altered the last residue (V84I) in plasmid pLK99 in order to retain a 


nucleotide sequence context upstream of TRS6 identical to that in the MHV 
genome (see Fig. 1A). In addition, synonymous changes were made in the 
penultimate codon (GAC to GAU) and the stop codon (UAG to UAA). The 
SARS-CoV strain Urbani E ORF was the same as that under GenBank accession 
number AY278741, except for a single residue difference (L37H), which corre¬ 
sponds to the consensus sequence of our laboratory stock of SARS-CoV Urbani. 
The TGEV strain Purdue E ORF was identical to that under GenBank accession 
number AJ271965. The IBV strain M42 E ORF was the same as that found 
under GenBank accession number AY692454, except for a single residue differ¬ 
ence (L73I). For the substituted SARS-CoV, TGEV, and IBV E genes in 
pLK105, pLK107, and pLK103, respectively, the E ORF was placed upstream of 
the EcoRV site in order to preserve 10 nucleotides (nt) of MHV genomic 
sequence upstream of TRS6 (see Fig. 1A). 

Oligonucleotides for PCR and DNA sequencing were obtained from Inte¬ 
grated DNA Technologies. For every constructed vector, overall plasmid com¬ 
position was monitored by restriction analysis, and the DNA sequences of all 
ligation junctions and all regions created by PCR amplification were verified by 
automated sequencing. 

Mutant construction by targeted RNA recombination. Mutants of MHV con¬ 
taining E gene modifications were isolated by targeted RNA recombination, 
using a host range-based selection described previously (see Fig. IB) (27, 28, 38). 
In short, monolayers of feline FCWF cells were infected with the interspecies 
chimeric coronavirus fMHV.v2, which is an MHV recombinant containing the 
ectodomain of the S protein of feline infectious peritonitis virus (FIPV) (18). 
Infected cells were then transfected with capped synthetic donor RNA by elec¬ 
troporation (Gene Pulser II; Bio-Rad). Donor RNAs were synthesized by in vitro 
transcription with T7 RNA polymerase (mMessage mMachine; Ambion) using 
Pacl-truncated plasmid templates. The infected and transfected feline cells were 
plated into 10-cm 2 wells, and released progeny virus was harvested at 30 h 
postinfection at 37°C. Recombinant candidates were selected and purified by two 
rounds of plaque titration on mouse L2 cell monolayers at 37°C. 

For analysis of recombinant candidates, total RNA was extracted from in¬ 
fected 17C11 cell monolayers with the Ultraspec reagent (Biotecx), and reverse 
transcription (RT) of RNA was carried out with a random hexanucleotide primer 
(Roche) and avian myeloblastosis virus reverse transcriptase (Life Sciences). To 
confirm the presence of incorporated mutations or genes, PCR amplifications of 
cDNAs were performed with AmpliTaq polymerase (Roche) using primer pairs 
flanking the E-gene region. RT-PCR products were analyzed directly by agarose 
gel electrophoresis and were purified with Quantum-prep columns (Bio-Rad) 
prior to automated sequencing. 

TGEV E protein gain-of-function mutants were isolated as intermediate- to 
large-plaque variants that arose following two to four passages of independent 
plaques of the original TGEV E-gene substitution mutant. The loci of new 
mutations were determined by sequencing of RT-PCR products spanning the 
entire M and E genes of each mutant. A number of candidate gain-of-function 
mutations were reconstructed by splicing overlap extension-RT-PCR amplifica¬ 
tion of E genes from infected cell RNA, followed by cloning of the resulting 
product between the Sbfl and EagI sites of pMH54. Substituted TGEV E genes 
containing candidate gain-of-function mutations were then transferred to MHV 
via targeted RNA recombination. 

Virus purification. Wild-type and IBV E substitution mutant viruses were 
purified exactly as described previously (52) by polyethylene glycol precipitation 
followed by two cycles of equilibrium centrifugation on preformed gradients of 0 
to 50% potassium tartrate osmotically counterbalanced with 30 to 0% glycerol, 
in a buffer of 50 mM Tris-maleate (pH 6.5) and 1 mM EDTA. For further 
purification, a modification of a previously reported sucrose flotation procedure 
was used (28, 47). Virus in 0.5 ml of phosphate-buffered saline (PBS) was mixed 
with 1.25 ml of 70% sucrose in TM buffer (20 mM Tris-HCl, 20 mM MgCl 2 , pH 
7.2) to give a final concentration of 50% sucrose. Each virus-sucrose mixture was 
transferred to the bottom of an SW60 centrifuge tube, and above it layers of 1 ml 
of 48% sucrose in TM, 0.5 ml of 40% sucrose in TM, and 0.5 ml 30% sucrose in 
TM were added. Samples were centrifuged at 133,000 X g in a Beckman SW60 
rotor at 4°C for 46 h. The top 1 ml of each tube was collected and diluted with 
PBS, and virus was pelleted by centrifugation at 151,000 X g in a Beckman SW41 
rotor at 4°C for 3 h; the virus pellet was then dissolved in SDS-PAGE sample 
buffer. 

Western blotting. To generate a sufficiently high-titer stock of the TGEV 
E-gene substitution mutant for subsequent infections for Western blot analysis, 
we grew this mutant and a subset of reconstructed TGEV E gain-of-function 
mutants in 17C11 cells expressing the MHV E protein. This cell line was gener¬ 
ated using the plasmid pTetON-IRES-MHV-E, in which the MHV E gene was 
linked to the encephalomyocarditis virus internal ribosome entry site (IRES) and 
inserted into the vector pTet-On (Clontech). pTetON-IRES-MHV-E, which also 
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contains the neomycin/kanamycin resistance gene, was provided by Volker Thiel 
(Cantonal Hospital, St. Gallen, Switzerland). Cells constitutively expressing 
the MHV E protein were selected on the basis of resistance to Geneticin 
(Invitrogen). 

For the preparation of cell lysates, confluent 25-cm 2 monolayers of regular 
(not E-expressing) 17C11 cells were mock infected or were infected with wild- 
type MHV or one of the constructed mutants. Cells were then incubated at 37°C 
for 12 to 14 h. Monolayers were washed twice with PBS and then lysed by 
addition of 600 pi of 50 mM Tris-HCl, pH 8.0, 150 mM NaCl, 1.0% Nonidet 
P-40, 0.7 |xg/ml pepstatin, 1.0 |xg/ml leupeptin, 1.0 |xg/ml aprotinin, and 0.5 
mg/ml Pefabloc SC (Roche). Lysates were held for 5 to 15 min on ice and were 
then clarified by centrifugation. Samples of infected-cell lysates were separated 
by SDS-PAGE through 15% or 17.5% polyacrylamide and were transferred to 
polyvinylidine difluoride membranes. Blots were then probed with various anti¬ 
bodies, and bound primary antibodies were visualized using a chemilumines¬ 
cence detection system (ECL; Amersham). 

RESULTS 

Expression of E protein is critical for MHV replication. In a 

prior study, we reported the generation of a viable MHV 
recombinant, AE, in which the E gene, together with accessory 
genes 4 and 5a, was deleted (30). Our isolation of the AE 
mutant demonstrated that the E protein is not absolutely es¬ 
sential for MHV replication, as had been previously assumed. 
However, the AE mutant grew very poorly, reaching maximal 
titers that were at least three orders of magnitude lower than 
those of wild-type MHV, and the AE virus produced very tiny 
plaques when plated onto mouse L2 cell monolayers (30). We 
concluded that this extremely defective phenotype was the 
result of the E-gene deletion, since earlier work by a number of 
laboratories had shown that mutation, deletion, or replace¬ 
ment of genes 4 and 5 a had no significant effect on viral 
phenotype in tissue culture (11,13, 40, 49, 53). Nevertheless, it 
could not be entirely ruled out that the characteristics of the 
AE mutant were partially due to the size of the genomic de¬ 
letion, the concomitant loss of one or both of the accessory 
genes, or the new context that had been created upstream of 
TRS6, which governs transcription of the M gene. 

To ascertain whether the highly defective phenotype ob¬ 
served for the AE mutant did indeed result from ablation of 
the E gene, we generated a recombinant virus in which E 
protein expression was specifically abolished. For this purpose, 
we designed an E gene knockout plasmid, pLKlOl, in which 9 
point mutations within the first 17 bases of the E ORF changed 
the start codon from AUG to ACG and instituted stop codons 
in all 3 reading frames immediately downstream (Fig. 1A). The 
remainder of the E gene, as well as genes 4 and 5a, was kept 
intact, except that an additional amino acid was appended to 
ORF 5a as a result of one of the introduced E ORF stop 
codons. These mutations were transferred to the MHV ge¬ 
nome by the technique of targeted RNA recombination, using 
host-range-based selection with the chimeric virus fMHV.v2 
(Fig. IB) (18, 28, 38), and the presence of all intended nucle¬ 
otide changes was confirmed by direct sequencing of RT-PCR 
products. The phenotype of the resulting E knockout mutant, 
E-KO, was identical to that of AE. When plated onto mouse L2 
cells, the E-KO mutant produced plaques that were indistin¬ 
guishable from those of the AE mutant and were tiny by com¬ 
parison to plaques of the wild type (Fig. 2A). Moreover, the 
E-KO mutant grew very slowly in murine 17C11 cells, with 
exceptionally low peak titers compared to those of the wild 
type. The wild-type MHV used throughout this study was 


Alb240, a well-characterized isogenic recombinant that had 
been previously reconstructed from fMHV and pMH54 donor 
RNA (30). 

To examine E protein expression in wild-type, AE, and 
E-KO viruses, we generated antiserum specific for the E pro¬ 
tein of our laboratory strain of MHV-A59. We (30) and others 
(11) have previously found that antisera raised against the E 
proteins of other laboratory strains of MHV (43, 54) fail to 
recognize our E protein in Western blots, owing to sequence 
differences of as few as two amino acids. We therefore bacte- 
rially expressed and purified an amino-terminal hexahistidine- 
tagged fragment of the MHV E protein (E-tail), corresponding 
to amino acids 38 through 83, and this polypeptide was used to 
immunize rabbits. The specificity of the collected antiserum 
was demonstrated by Western blotting with samples from a 
number of sources. Anti-MHV E antiserum was found to react 
with bacterially expressed full-length, hexahistidine-tagged E 
protein (10.7 kDa) and with the original E-tail immunogen (6.2 
kDa) (Fig. 2B). The antiserum also specifically reacted with E 
protein that was expressed from an SFV RNA expression vec¬ 
tor in mouse L2 cells (9.7 kDa) but not with lysates from 
mock-transfected cells or cells transfected with empty SFV 
vector or SFV expressing MHV M protein. Similarly, the an¬ 
tiserum reacted with a 9.7-kDa band in wild-type MHV-in- 
fected 17C11 cells but not with lysates from mock-infected or 
AE-infected cells (Fig. 2B). 

Further Western blot analyses showed that no E protein was 
expressed by either the E-KO mutant or the AE mutant. As 
shown in Fig. 2C, there was no detectable signal corresponding 
to E protein in lysates of L2 cells infected with two indepen¬ 
dent isolates each of E-KO or AE, whereas wild-type-infected 
controls exhibited a strong band for E protein. Both E-KO- 
infected and AE-infected lysates exhibited the same nonspe¬ 
cific background of higher-molecular-weight bands as was seen 
in wild-type-infected and mock-infected controls. Prolonged 
exposure did not reveal a detectable specific E protein signal in 
either of the two mutants. Moreover, the lack of E expression 
in either E-KO-infected or AE-infected cells was unlikely to 
have resulted from their slow virus growth relative to that of 
the wild type, since all infected-cell lysates were harvested at 
similar stages of infection, when nearly 90% syncytium forma¬ 
tion was observed. Consistent with this, a duplicate blot of the 
same samples probed with anti-N monoclonal antibody, as an 
internal control for the extent of infection, showed similar 
levels of N expression in all infected-cell lysates (Fig. 2C). It 
should be noted that since the anti-MHV E antiserum was 
raised against the carboxy terminus of E protein, we would also 
have been able to detect an amino-terminally truncated E 
protein fragment initiated from a downstream methionine at 
codon 25 in the E-KO mutant if such a species had been 
synthesized. Our results with the E-KO mutant thus confirmed 
our previous conclusion that the defects observed in the AE 
mutant directly resulted from the absence of E protein expres¬ 
sion. The combined findings from both E mutants established 
that E protein expression, while not essential in MHV, is crit¬ 
ical for the virus to carry out a productive infection. 

Heterologous E genes substituted from other coronaviruses 
have various levels of functionality in MHV. The dramatic 
impairment of the AE and E-KO mutants provided a basis for 
us to examine the constraints on E protein sequence and struc- 
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pMH54 .. .CGCAGCTCGAAAGAAATGTTTAATTTATTCCTTACAGACACAGTATGG... 

(ORF 5a) . . .Q L E R N V * 

M F N L F L T D T V W. . . (E ORF) 

pLKIOI .. .CGCAGCTCGAAAGAAAcGTaaAATaatTTaaaTACAGACACAGTATGG... 
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. . . AAAGAAATGTTTAATTTATTC.CTATTAGAGGTGGATGATATCTAATCCAAACATTATGAGTAGTACT . . . 

M F N L F. . . (MHV E ORF) ....LLEVDDI* M S S T. . . (M ORF) 
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. . . AAAGAAATGTACTCATTCGTT.CTTCTGGTCTAAGATGATATCTAATCCAAACATTATGAGTAGTACT . . . 

M Y S F V. (SARS-CoV E ORF) . L L V * M S S T. . . (M ORF) 

pl_K107 . . 

. . .AAAGAAATGACGTTTCCTAGG. 


.CTCCTTGCTTGAGATGATATCTAATCCAAACATTATGAGTAGTACT. . . 

M T F P R... (TGEV E ORF) ...L L A * M S S T...(M ORF) 
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. . . AAAGAAATGATGAATTTATTG.TTGTACTCTTGAGATGATATCTAATCCAAACATTATGAGTAGTACT . . . 

M M N L L...( IBV E ORF) ....L Y S * MSST...(M ORF) 
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FIG. 1. Construction of E protein knockout and substitution mutants of MHV. (A) Transcription vectors used for the synthesis of donor RNA 
were derived from pMH54 (28), which contains a 5' segment of the MHV genome (denoted by [1]) fused to a partial hemagglutinin (HE) gene 
and all genes downstream of HE. The T7 RNA polymerase start site is denoted by an arrow; the locations of restriction sites relevant to plasmid 
construction (Sbfl, EcoRV, and EagI) or template linearization (PacI) are indicated. Shown above the pMH54 schematic are mutations 
(lowercase) incorporated into pLKIOI to create the E knockout mutant. Changes in the first 17 bases of the E ORF eliminated the start codon 
and generated stop codons in all three reading frames (underlined). Shown beneath the pMH54 schematic are the sequences of the junctions of 
heterologous E genes substituted for the MHV E gene in transcription vectors pLK99 (BCoV E gene), pLK105 (SARS-CoV E gene), pLK107 
(TGEV E gene), and pLK103 (IBV E gene). The 5' end of every substituted E ORF was a direct AUG-for-AUG exchange. At the 3' end of every 
substitution, the last 12 bases of the MHV E ORF were left intact in order to retain the upstream context of TRS6, as detailed in Materials and 
Methods. TRS6 and the EcoRV site are indicated with solid and broken lines, respectively. (B) Method of selection of E protein mutants by 
targeted RNA recombination between the interspecies chimeric virus fMHV.v2 (18) and donor RNAs transcribed from transcription vectors shown 
in panel A. The fMHV.v2 genome encodes the ectodomain region of the FIPV S gene (shaded rectangle) in place of that of the MHV S gene; 
consequently, fMHV.v2 grows in feline cells but not in murine cells. A single crossover event within the HE gene generates a recombinant that 
has simultaneously reacquired the MHV S ectodomain and the mutated E gene (solid rectangle). The rearranged order of genes following the S 
gene in fMHV.v2 rules out the occurrence of undesired secondary crossovers downstream of the S gene (18). Recombinants are selected as progeny 
that have reacquired the ability to grow in murine cells. 


ture by determining whether E genes from other coronaviruses 
can functionally replace that of MHV. Coronaviruses are clas¬ 
sified into three phylogenetic groups (37), which are likely to 
be accorded the status of genera (19); MHV is the prototype 


member of group 2. E protein sequences diverge widely among 
the three coronavirus groups. In pairwise amino acid sequence 
comparisons, the MHV E protein ranges from a maximum of 
65% identity with its closest group 2 relative, the BCoV E 
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FIG. 2. Phenotype of the E-KO mutant. (A) Plaques of the E-KO mutant (Alb517) compared with those of the AE mutant (Alb291) (30) and 
an isogenic wild-type control (Alb240) (30) that was reconstructed with pMH54 donor RNA. Plaque titrations were carried out with L2 cells at 
37°C. Monolayers were stained with neutral red at 72 h postinfection and were photographed 18 h later. Mock-infected cells are shown at the left. 
(B) Specificity of rabbit polyclonal antiserum raised against the carboxy-terminal tail of the MHV E protein (amino acids 38 through 83). 
Antibodies were used to probe a Western blot of the following: lanes 1 to 4, lysates of mock-transfected L2 cells or L2 cells transfected with empty 
SFV vector, SFV vector encoding MHV M protein, or SFV vector encoding MHV E protein; lanes 5 to 7, lysates of mock-infected, wild-type 
MHV-infected, or AE-infected 17C11 cells; lanes 8 and 9, lysates of E. coli expressing the carboxy-terminal tail of E or full-length E. (C) Western 
blots of lysates from L2 cells that were either mock infected or infected with wild-type MHV (Alb240) or with two independent isolates each of 
the AE mutant (Alb290 and Alb291) or the E-KO mutant (Alb517 and Alb518). Blots were probed with polyclonal anti-MHV E antiserum or with 
monoclonal anti-N antibody J.3.3 (15). 


protein, to a mere 22% identity with the E protein of the group 
3 IBV E protein (Fig. 3A). Similarly, pairwise sequence iden¬ 
tities among a set of coronavirus E proteins, other than that of 
MHV, fall between 28% and 16% (Fig. 3A). A more compre¬ 
hensive analysis of pairwise sequence identities found that the 
E protein has the highest intergroup divergence of the four 
canonical coronavirus structural proteins (19). A multiple 
alignment of a set of E proteins from all three coronavirus 
groups shows only four completely conserved residues and 
considerable heterogeneity in the size of this small protein 
(Fig. 3B). Despite such extensive variability, common charac¬ 
teristics can be discerned for the different E proteins. Each has 
a short hydrophilic amino terminus followed by a relatively 
large transmembrane domain. The remainder of the polypep¬ 
tide is a carboxy-terminal tail, constituting one-half or more of 
the molecule. Two to four cysteine residues are found within 
the transmembrane domain or in the membrane-proximal por¬ 
tion of the carboxy-terminal tail. As yet, however, the exact 
boundaries of the transmembrane domain have not been well 
defined for any E protein. Moreover, membrane protein to¬ 
pology prediction programs other than the one that we have 
used (7) position the transmembrane domains somewhat dif¬ 


ferently than is shown in Fig. 3B, and in many cases, a second 
transmembrane segment is predicted within the carboxy-ter¬ 
minal tail. 

To substitute E genes from other coronaviruses, we in each 
case swapped the MHV E ORF with a heterologous E ORF by 
making an exact exchange of start codons and by making an 
exact, or nearly exact, exchange of the stop codon position, as 
shown in Fig. 1A and described in detail in Materials and 
Methods. Because the MHV E ORF is not preceded by its own 
TRS, it consequently follows ORF 5a in subgenomic mRNA5. 
The translation of the MHV E protein has been proposed to be 
governed by an IRES that occupies part of the immediately 
upstream gene 5a (25, 45). Since the IRES could conceivably 
affect both the level and timing of E protein expression, we 
made an AUG-for-AUG replacement at the upstream end of 
each substitution in order to preserve the E ORF genomic 
context as closely as possible (Fig. 1A). Similarly, at the down¬ 
stream end of each substitution, we made the ORF replace¬ 
ment such that the immediately upstream context of TRS6 was 
retained in order to avoid unintended effects on the expression 
of the downstream M gene. In the case of the BCoV E ORF, 
the latter criterion allowed us to position the stop codon of the 
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FIG. 3. Replacement of the MHV E protein with heterologous coronavirus E proteins. (A) Percent amino acid identities for pairwise alignments 
between representative members of all three coronavirus groups. Pairwise alignments were generated with the gap program of the Genetics Computer 
Group sequence analysis package (16). (B) Multiple alignment of the coronavirus E proteins used in this study. The alignment was produced manually 
from comparison of the pairwise alignments. Residues that are common to three, four, or all five sequences are marked by open, gray, or black circles, 
respectively. The solid bar represents the limits of the transmembrane domains of each of the sequences, as assigned by the dense alignment surface 
method (7) (http://www.sbc.su.se/~miklos/DAS/); the thicker section of the bar indicates positions predicted to be transmembrane residues in all five E 
proteins. (C) Plaques of heterologous E gene substitution mutants, compared with plaques of wild-type MHV and the AE mutant. Plaque titrations were 
carried out on L2 cells at 37°C. Monolayers were stained with neutral red at 72 h postinfection and were photographed f 8 h later. 


heterologous ORF at the same nucleotides as those of its 
MHV counterpart. This positioning was done at the expense of 
altering the last residue of the BCoV E protein from valine to 
isoleucine, a conservative change that turned out to have no 


effect. For the SARS-CoV, TGEV, and IBV E ORFs, the stop 
codon was moved 12 nucleotides upstream of the position of its 
MHV counterpart, thereby leaving unchanged the identity of 
the 10 nucleotides preceding TRS6 (Fig. 1A). 
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Recombinants containing heterologous coronavirus E genes 
were generated by targeted RNA recombination. Multiple re¬ 
combinant viruses were obtained for each E replacement, and 
two independent isolates of each substitution mutant were 
purified, confirmed by sequencing of the E gene and its bound¬ 
aries, and used in further analyses. We first examined the 
plaque size phenotype of each chimeric virus. As expected, the 
recombinant containing the highly homologous BCoV E gene 
in place of the MHV E gene was indistinguishable from wild- 
type MHV. It produced wild-type-sized plaques on mouse L2 
cells (Fig. 3C) and grew as efficiently as and to similar titers as 
wild-type MHV. Similarly, the E gene derived from SARS- 
CoV was almost completely able to functionally replace its 
MHV counterpart. The recombinant containing the SARS- 
CoV E gene produced plaques nearly equivalent in size to 
those of the wild type. This finding is in line with the classifi¬ 
cation of SARS-CoV as an outlying member of the group 2 
coronaviruses, more closely related to MHV and BCoV than 
to other members of the family (21). It must be noted that the 
SARS-CoV E gene used in our study differs from the reported 
SARS-CoV strain Urbani sequence (44) by a single transver¬ 
sion that results in a change of codon 37 from CUU (leucine) 
to CAU (histidine). However, since the sequence containing 
this change is the only sequence detected in our laboratory 
stock of SARS-CoV, it must represent a functional E gene for 
that virus. We therefore do not think that the L37H change 
accounts for the slightly diminished plaque size in the SARS- 
CoV E substitution mutant. 

In contrast to the intragroup E gene substitutions, the E 
gene from the group 1 coronavirus TGEV was not able to 
functionally replace the MHV E gene. The recombinant con¬ 
taining the TGEV E gene formed very tiny plaques that were 
not measurably different from those of the AE mutant (Fig. 
3C). One explanation for this result might have been that the 
transplanted TGEV E gene was not expressed in the MHV 
genome, but this possibility was later ruled out (see below). 
Therefore, unlike the BCoV or SARS-CoV E protein, the 
TGEV E protein was inert with respect to facilitation of MHV 
virion assembly. This outcome, the incompatibility of group 1 
and group 2 structural proteins in virion morphogenesis, ap¬ 
pears to echo previous results in which VLPs could not be 
formed by coexpression of BCoV M protein and TGEV E 
protein or, conversely, by coexpression of TGEV M protein 
and BCoV E protein (2). 

The most surprising result with the substitution mutants was 
that the E gene from IBV, a group 3 coronavirus, was com¬ 
pletely functional in MHV. This finding was unexpected, since 
the IBV E protein has the lowest degree of sequence homology 
with the MHV E protein and has a size difference of 26 amino 
acids (Fig. 3A and B). The chimeric MHV mutant containing 
the IBV E gene formed wild-type-sized plaques on L2 cells 
(Fig. 3C), and it grew efficiently in 1701 cells, to high titers 
equivalent to those of wild-type MHV (data not shown). Viral 
protein expression in the IBV E substitution mutant was as¬ 
sessed by Western blot analysis comparing lysates from mu¬ 
tant-infected cells with those from wild-type-infected and 
mock-infected controls (Fig. 4). This confirmed that IBV E 
protein was expressed in two independent isolates of the sub¬ 
stitution mutant, whereas no trace of MHV E was detected in 
the same mutants on a duplicate blot probed with anti-MHV E 
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FIG. 4. Western blots of lysates from 17C11 cells that were mock 
infected or were infected with wild-type MHV (Alb240) or with two 
independent isolates of the IBV E substitution mutant (Alb499 and 
Alb500). Blots were probed with polyclonal anti-IBV E antiserum (4), 
polyclonal anti-MHV E antiserum, monoclonal anti-N antibody J.3.3 
(15), or monoclonal anti-M antibody J.1.3 (15). 


antiserum. The latter result ruled out the possibility that a 
functional copy of the MHV E gene was somehow retained 
elsewhere in the substitution mutants. Additionally, the expres¬ 
sion of two other MHV structural proteins, N and M, showed 
little or no alteration as a consequence of the IBV E gene 
substitution; the same lysates probed with monoclonal antibod¬ 
ies against MHV N or M gave rise to nearly equivalent signals 
for wild-type MHV and the substitution mutant (Fig. 4). These 
data indicate that the IBV E protein can substitute for the 
MHV E protein in MHV replication. 

To determine whether IBV E protein was incorporated into 
virions of the substitution mutant, we purified viruses released 
from infected 17C11 cells by two cycles of equilibrium sedimen¬ 
tation on tartrate-glycerol gradients (52). Western blot analysis 
of purified virions revealed that IBV E protein was incorpo¬ 
rated into virions that were otherwise entirely composed of 
MHV components (Fig. 5A). Because the IBV and MHV E 
proteins were detected with different antisera with different 
affinities for their respective antigens, we could not determine 
whether equivalent molar amounts of the two E proteins were 
incorporated into virions. However, the quantity of each E pro¬ 
tein incorporated into virions was a similar fraction of the total E 
protein expressed in infected cells. To further test the stringency 
of the virus preparation, gradient-isolated virions were addition¬ 
ally purified by a sucrose flotation procedure (28, 47). This con¬ 
firmed that IBV E protein was a component of virions of the 
substitution mutant (Fig. 5B). 

Specific mutations in either the transmembrane domain or 
the carboxy-terminal tail of the TGEV E protein render it 
functional in MHV. The phenotype of the TGEV E substitu¬ 
tion mutant, which was essentially the same as that of the AE 
mutant, presented us with the opportunity to search for muta- 
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FIG. 5. Incorporation of IBV E protein into virions. (A) Wild-type MHV (Alb240) and an IBV E substitution mutant (Alb499) were grown 
in 17C11 cells and purified by two cycles of equilibrium sedimentation on tartrate-glycerol gradients, as described in Materials and Methods. 
(B) Portions of isolated virus were further purified by sucrose flotation. Virions and cell lysate controls were analyzed by Western blots probed 
with polyclonal anti-IBV E antiserum (4), polyclonal anti-MHV E antiserum, monoclonal anti-N antibody J.3.3 (15), or monoclonal anti-M 
antibody J.1.3 (15). 


tions that might facilitate better growth of this virus. To ac¬ 
complish this, we started cultures from multiple individual 
plaques of the TGEV E substitution mutant and serially pas¬ 
saged each of them two to four times in L2 cells, frequently 
monitoring the spread of infection in the monolayer. When 
accelerated growth was detected for a given passage, viral 
supernatants were titered on L2 cells. From this procedure we 
identified and purified plaques that were markedly larger than 
those of the original mutant, although none had fully attained 
the size of plaques of wild-type MHV. A total of 30 viruses with 
enhanced growth, representing 26 independent mutants of the 
original TGEV E substitution recombinant, were isolated in 
this fashion through two separate searches. 

To map genetic changes in each of these TGEV E gain-of- 
function mutants, we sequenced RT-PCR products that 
spanned the entirety of the E and the M genes. The results of 
this analysis are summarized in Fig. 6. For each mutant, we 
found at least one base change in the TGEV E gene. By 
contrast, no changes were seen in the M gene of any mutant. 
All identified mutations resulted in coding changes in the 
TGEV E ORF; there were no silent nucleotide changes. Most 
mutations were found in more than one mutant, which suggests 
that our searches were saturating. With two exceptions, all 
changes were point mutations, and in almost all cases, these 


occurred as a single point mutation per mutant. The paucity of 
multiple mutations in members of this set was likely a reflec¬ 
tion of our approach, in which we searched for large-plaque 
variants at the earliest passage in which an upward shift in viral 
growth rate was observed. Most strikingly, the mutations were 
found to cluster almost exclusively in two small regions of the 
TGEV E protein, one in the transmembrane domain, between 
residues 25 and 32, and the other within the carboxy-terminal 
tail, between residues 50 and 60 (Fig. 6). 

Two of the TGEV E mutants harbored changes that were 
not point mutations. Mutant TE29 contained a 15-nt insertion 
that resulted in the replacement of 155 with six amino acids, 
MSLMKF. It is notable that this alteration also occurred in 
one of the two regions where point mutations were found to 
cluster. Curiously, the insertion in mutant TE29, 5'GUCGUU 
GAUGAAGUU3', is identical to nucleotides 17,472 to 17,486 
of the MHV genome, which are within the nspl3 segment of 
the pplab ORF. This suggests that the insertion arose from a 
nonhomologous recombination event. A second mutant, TE30, 
contained a 103-nt deletion that fused a thereby truncated 
ORF 5a to codon 15 of the TGEV E ORF. We have not yet 
determined whether the potential 5a-E fusion protein is ex¬ 
pressed. The mutations in TE29 and TE30 recall other non¬ 
homologous rearrangements that we have found in revertants 
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FIG. 6. Summary of the sequence analysis of TGEV E gain-of-function mutants. Two separate searches for intermediate- or large-plaque 
mutants of the TGEV E substitution recombinant were performed following serial passaging of independent plaques. Mutants that were obtained 
in the first search are denoted by open circles; mutants that were obtained in the second search are denoted by filled circles. All mutants are 
independent except for the following pairs: TE1 and TE26; TE23 and TE30; and TE27 and TE28. The entire E and M genes of all mutants were 
sequenced. The wild-type TGEV E protein is represented linearly at the top; the solid rectangle indicates the predicted transmembrane domain. 
Positions within the E protein at which potential gain-of-function mutations were found are connected by arrows to the wild-type residue, under 
which is indicated changes found in each mutant. All coding changes are shown; there were no noncoding changes. No changes were found in the 
M gene of any of the revertants. Boxed residues, or pairs of residues, are those that were chosen for further analysis. At the bottom are details 
about an insertion and a deletion, respectively, found in mutants TE29 and TE30. 


of M and N protein mutants that had extremely defective 
phenotypes (24, 29). We believe that coronavirus RNA syn¬ 
thesis must produce such rearrangements at a constant low 
rate and that they become visible only when revealed by a 
sufficiently strong selective pressure. 

To determine whether the mutations found in the TGEV E 
gain-of-function mutants were indeed responsible for the en¬ 
hanced growth of the mutants, we reconstructed many TGEV 
E substitution recombinants, each containing one of a subset of 
these mutations (those that are boxed in Fig. 6). This permitted 
us to specifically gauge the effects of individual mutations in 
the absence of possible mutations elsewhere in the genome. 
From the cluster of mutations in the transmembrane region, 
we separately reconstructed TGEV E substitution recombi¬ 
nants with L31V, L32R, and I17V-plus-L32R mutations. In all 


three cases, the reconstructed mutants produced considerably 
larger plaques than did the original TGEV E substitution mu¬ 
tant (Fig. 7), and the efficiency of virus replication was greatly 
enhanced. Likewise, we reconstructed TGEV E substitution 
recombinants with mutations from the carboxy-terminal tail 
cluster: G50A, I55S, I55T, V56G, V56A, P57L, H60Y, and 
I17V plus H60Y. These reconstructed mutants also produced 
plaques that, to various degrees, were significantly larger than 
those of the TGEV E substitution mutant (Fig. 7). Plaques of 
all reconstructed mutants were indistinguishable from those of 
the original isolates in which the mutations had been identi¬ 
fied. In addition, reconstructed mutants exhibited the same 
extents of enhanced growth as did the original gain-of-function 
isolates. Passage 1 stocks begun from single plaques of the 
TGEV E substitution mutant had titers on the order of 10 2 
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FIG. 7. Plaques of a reconstructed subset of 11 gain-of-function mutations, representing 20 of the TE mutants, compared with plaques of the 
wild type and the original TGEV E substitution mutant. Plaque titrations were carried out on L2 cells at 37°C. Monolayers were stained with 
neutral red at 72 h postinfection and were photographed 18 h later. 


PFU per ml. By contrast, passage 1 stocks of either the original 
or the reconstructed TGEV E gain-of-function mutants ranged 
from 10 4 to 10 5 PFU per ml. These results allowed us to 
conclude that the mutations are directly responsible for the 
larger plaque size and enhanced growth phenotypes of the 
gain-of-function mutants. The greatest increases in plaque size 
were observed with the L31V or L32R change in the trans¬ 
membrane domain and with the I55S, I55T, V56G, or P57L 
change in the carboxy-terminal tail of TGEV E. These posi¬ 
tions may thus define the core residues of each cluster, where 
changes can exert the greatest effect. Intermediate effects on 
plaque size were found to be caused by mutations at residues 
more distant from either of these two loci. For two of the core 
residues, 155 and V56, two different mutations each were re¬ 
constructed. TGEV E substitution recombinants with I55S and 
I55T mutations produced equally large plaques, whereas the 
recombinant with V56G produced larger plaques than did the 
recombinant with V56A (Fig. 7). Two reconstructed recombi¬ 
nants with double mutations, I17V plus L32R and I17V plus 
H60Y, were not significantly different from recombinants with 
the respective single mutations, L32R and H60Y. This suggests 
that the 117V mutation did not function synergistically with 
either of the other mutations with which it had been isolated in 
double mutants. We do not yet know the effects, if any, of 
pairing other mutations that were not originally found to¬ 
gether. In sum, our reconstructions confirmed that any one of 


several single mutations in either of two crucial regions of the 
molecule was sufficient to render the TGEV E protein func¬ 
tional in MHV. 

In addition, we examined protein expression by the original 
TGEV E substitution mutant and by four of the reconstructed 
gain-of-function mutants. For this purpose, an antipeptide an¬ 
tiserum was raised against the carboxy-terminal 22 amino acids 
of the TGEV E protein. This antiserum was used in Western 
blots of lysates from cells infected with the TGEV E substitu¬ 
tion mutant or with four of the reconstructed gain-of-function 
mutants, along with lysates from wild-type-infected and mock- 
infected controls (Fig. 8). This analysis revealed equivalent 
expression of TGEV E protein by two independent isolates of 
the original substitution mutant and the reconstructed mutants 
TE9 (L31V), TE11 (L32R), TE27 (I17V plus L32R) and TE28 
(117V plus H60Y). The anti-TGEV E antiserum did not cross- 
react with MHV E protein, and conversely, the anti-MHV E 
antiserum did not cross-react with TGEV E protein. Also, the 
levels of MHV N protein, probed as an internal infection 
control, were comparable among all lysates, indicating that the 
expression of TGEV E protein in the original substitution 
mutant did not impair the expression of another MHV struc¬ 
tural protein. These results demonstrated that the failure of 
the original (wild-type) TGEV E gene to functionally substi¬ 
tute for that of MHV was not due to failure of the original 
substitution mutant to express the TGEVE protein. Therefore, 
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FIG. 8. Western blots of lysates from 17C11 cells that were mock 
infected or were infected with wild-type MHV (Alb240), two indepen¬ 
dent isolates of the TGEV E substitution mutant (Alb515 and Alb516), 
or four reconstructed gain-of-function mutants (TE27, TE28, TE9, or 
TE11). Blots were probed with polyclonal anti-TGEV E antiserum, 
polyclonal anti-MHV E antiserum, or monoclonal anti-N antibody 
J.3.3 (15). 


the gain-of-function mutations acted through qualitative, 
rather than quantitative, effects on the TGEV protein. 

DISCUSSION 

To obtain insights into the role played by E protein in coro- 
navirus assembly, we examined the extent to which heterolo¬ 
gous E proteins could replace that of MHV. As a prerequisite 
to this study, however, we needed to determine whether the 
AE mutant that we had previously isolated (30) truly repre¬ 
sented the E-null phenotype in MHV. This had not been un¬ 
equivocally demonstrated, because the AE mutant also con¬ 
tained complete deletions of genes 4 and 5a. Although prior 
work with a number of strains of MHV had shown that genes 
4 and 5a are nonessential for growth in tissue culture (11, 13, 
40, 49, 53), a A45a mutant had exhibited slightly reduced ki¬ 
netics of growth and had been found to be attenuated in vivo 
(11). Therefore, to clarify the status of the AE mutant, we 
constructed an E-KO mutant in which only expression of the E 
gene was disrupted through the introduction of point muta¬ 
tions (Fig. 1). The E-KO mutant entirely recapitulated the 
severe growth defect of the AE mutant (Fig. 2A), and the 
ablation of E expression in both mutants was ascertained by 
Western blot analysis (Fig. 2B and C). We can thus conclude 
that knockout of the E gene is responsible for the observed 
defect and that the absence of genes 4 and 5 a did not make a 
detectable contribution to the phenotype of AE. The combined 
results establish that E protein, although not essential, is ex¬ 
tremely important for the growth of MHV. 

The exact nature of this importance, nevertheless, is cur¬ 


rently obscure. Some evidence appears to point to a require¬ 
ment for direct physical interactions between E and M proteins 
in virion morphogenesis. In coexpression systems, particular 
interspecies combinations of E and M proteins are not com¬ 
petent for VLP assembly (2, 12). Similarly, the physical prox¬ 
imity of the E and M proteins in IBV has been demonstrated 
by their chemical cross-linking (6) or coimmunoprecipitation 
(33) in infected or cotransfected cells, although these proper¬ 
ties did not always correlate with the capabilities of mutant E 
proteins to form VLPs with M protein (6). Conversely, many 
previous results could be taken to indicate that E protein acts 
at a distance, independently of it having to come into contact 
with M protein. The isolated expression of either IBV or MHV 
E protein leads to the formation and cellular export of mem¬ 
branous vesicles (4, 35). Likewise, the expression of MHV E 
protein has been shown to give rise to intracellular clusters of 
convoluted membrane structures (43) that strongly resemble 
those seen in coronavirus-infected cells (9). These observations 
suggest that the principal role of E protein is to induce mem¬ 
brane curvature in the budding compartment. 

To learn more about the requirements imposed on E protein 
sequence relative to the possible specificity of interactions be¬ 
tween E protein and M protein, we attempted to replace the 
MHV E gene with E genes from a range of phylogenetically 
conserved and divergent coronaviruses covering all three 
groups within the coronavirus family. This was carried out by 
creation of ORF-for-ORF exchanges, by which heterologous E 
proteins could be expressed in the same manner as native 
MHV E protein. The most conservative substitution made was 
that of the E protein of BCoV, the group 2 coronavirus that is 
the closest relative of MHV. The E gene of BCoV was fully 
functional when installed in place of the MHV E gene, and the 
resulting recombinant behaved identically to a wild-type MHV 
control (Fig. 3C). This result was not unexpected, since sub¬ 
stitutions of similarly homologous domains of the BCoV M or 
N protein have been found to be able to replace the counter¬ 
part domains in MHV (10, 42). More striking was our finding 
that the E protein of SARS-CoV, the outlying member of the 
group 2 coronaviruses (21), was almost fully able to replace the 
MHV E protein, notwithstanding the much lower degree of 
sequence identity between these two E proteins. 

Most remarkably, however, we observed that the MHV mu¬ 
tant with an E gene obtained from the group 3 coronavirus 
IBV was indistinguishable from wild-type MHV. The IBV E 
substitution mutant produced plaques identical in size to those 
of the wild type, and it grew efficiently, to titers equivalent to 
those of the wild type. This interchangeability was not hindered 
by the large disparity in primary sequence between the two E 
proteins. Not only does IBV E have the most limited homology 
to its MHV counterpart of any coronavirus (22% amino acid 
sequence identity), but it is also 26 residues longer than the 
83-amino-acid MHV E protein (Fig. 3B). This result, together 
with the substitution of the SARS-CoV E protein, supports the 
notion that E proteins function without having to make se¬ 
quence-specific contacts with other viral components, includ¬ 
ing, most notably, M protein. We did observe, however, that 
IBV E protein was incorporated into virions of the IBV E 
substitution mutant (Fig. 5), just as MHV E protein is incor¬ 
porated into MHV (54). Although this could be taken as evi¬ 
dence for a physical interaction between the E and M proteins, 
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we currently think that it merely reflects that these two pro¬ 
teins are colocalized in infected cells and that M-M interac¬ 
tions during budding do not entirely exclude E protein. 

At present, the precise mechanism by which IBV E is able to 
entirely replace MHV E is undetermined. Most likely, the two 
E proteins provide identical functionality during infection, de¬ 
spite their sequence divergence. It cannot be ruled out, how¬ 
ever, that IBV E offers an alternate, but equally efficient, op¬ 
tion for MHV assembly in the absence of MHV E. Elegant 
expression studies of the IBV E protein have shown that the 
carboxy-terminal tail of IBV E interacts with the IBV M pro¬ 
tein, and it also contains a Golgi-targeting signal (5, 6). Both of 
these characteristics were found to be important, and probably 
sufficient, for VLP formation. Further detailed study will be 
required to delineate how these results are commensurate 
with the mechanism by which IBV E compensates for the 
loss of MHV E in a complete viral infection. An attractive 
approach would be to examine the functionality within 
MHV of hybrid E proteins that mix domains taken from 
IBV E and from MHV E. 

Contrary to results with the substitutions of group 2 and 
group 3 E genes, the E gene from the group 1 coronavirus 
TGEV was inactive in MHV (Fig. 3C), even though the TGEV 
E protein was clearly expressed by the substitution mutant 
(Fig. 8). This finding strongly resembles the outcome of prior 
coexpression work showing that interspecies combinations of 
the M and E proteins of TGEV and BCoV fail to yield VLPs 
(2). Negative results were also obtained in VLP experiments 
that mixed the M and E proteins of FIPV and MHV (12). In 
both cases, as well as with our TGEV E substitution mutant, 
the incompatibilities that were found occurred between com¬ 
ponents from group 1 coronaviruses (TGEV and FIPV) and 
group 2 coronaviruses (MHV and BCoV). At first, these ob¬ 
servations appeared at variance with the idea that E protein 
acts independently of a physical association with M protein. 
However, we were able to isolate numerous gain-of-function 
mutants of the TGEV E substitution recombinant, in which 
single amino acid changes in the TGEV E protein allowed it to 
assist MHV replication. Since the gain-of-function mutant E 
proteins remain extremely divergent in sequence from MHV E 
protein, we believe that such a sequence difference is consis¬ 
tent with a mechanism of E acting at a distance from M. On the 
other hand, the fact that the TGEV E protein required mod¬ 
ification before it could replace its MHV counterpart suggests 
that some fundamental disparity exists in the details of how the 
TGEV and MHV E proteins operate during infections by their 
respective viruses. This disparity most likely resides at the level 
of interactions among viral components rather than in the 
ability of the TGEV E protein to function in mouse cells. All 
available evidence indicates that once the barrier of S protein- 
receptor recognition is overcome, there is no significant block 
to coronavirus replication in different species of cells. In par¬ 
ticular, the interspecies chimera mFIPV (12, 38), which is an 
FIPV recombinant containing the ectodomain of the MHV S 
protein, grows well in mouse cells. This indicates that the 
group 1 FIPV E protein (which has 94% sequence identity with 
the TGEV E protein) is fully functional in mouse cells in 
conjunction with its cognate viral structural proteins. 

All of the gain-of-function mutants that we selected in two 
separate searches mapped in the (TGEV) E gene (Fig. 6), and 


reconstruction of many of these mutants proved that the re¬ 
sponsible mutations had been correctly identified (Fig. 7). No 
changes were found in the (MHV) M genes of any of the 
gain-of-function mutants, supporting the idea that it was not 
possible to obtain modifications of M protein able to compen¬ 
sate for the deficiency of the wild-type TGEV E protein in 
MHV. In accord with this result, we previously found that 
revertants of clustered charged-to-alanine mutants of the 
MHV E protein mapped solely in the E gene and not in the M 
gene (14). Similarly, we have been able to isolate many inter- 
genic second-site revertants for extremely defective mutants of 
the M (29) and N (24) proteins of MHV, but none of these 
revertants mapped in the E gene. 

Almost all of the TGEV E gain-of-function mutations were 
clustered at two loci, one in the transmembrane domain (with 
the strongest effects resulting from changes in residues L31 and 
L32) and the other in the carboxy-terminal tail (with the stron¬ 
gest effects at residues 155, V56, and P57). The mutations that 
occurred at each of these positions actually reduced, rather 
than increased, the homology between TGEV E and MHV E. 
The nature of the mutations in the transmembrane region is 
particularly intriguing. Most of these, such as L25V, I27V, 
I27L, and L31V, are fairly conservative substitutions, perhaps 
indicative of subtle modifications in the interactions among 
hydrophobic residues in oligomers of E (46). By contrast, the 
L32R mutation, which arose independently in three distinct 
mutants (TE11, TE26, and TE27), might be expected to sig¬ 
nificantly alter the topology or orientation of the transmem¬ 
brane domain. 

The distribution of the TGEV E gain-of-function mutations 
into two clusters appears to highlight two important domains 
of the molecule, but it is yet indeterminate whether these 
domains govern separate functions or whether they interact 
with one another. Analyses of the IBV E protein have similarly 
indicated two separable functional domains for that molecule 
and have suggested that the transmembrane domain is not 
specifically required for either VLP formation or Golgi local¬ 
ization (5,6). Multiple roles of E protein in viral infection may 
also be implied by the lack of correspondence between the 
lethality of the E-KO mutants of TGEV (8, 41) and the via¬ 
bility of the AE and E-KO mutants of MHV (30; this work). It 
is conceivable that one function of E protein is to potentiate 
the kinetics of virion assembly, but this role is not essential. A 
hypothetical second function may be essential for TGEV, but 
either it is not required by MHV or else it is supplied redun¬ 
dantly by some other MHV protein. 

It was recently demonstrated that the E protein of SARS- 
CoV can form membrane channels having a selectivity for 
monovalent cations (51). Moreover, a peptide corresponding 
to the amino-terminal 40 amino acids of SARS-CoV E (com¬ 
prising the hydrophilic amino terminus and the transmem¬ 
brane domain) had the same properties as did full-length E. 
This result accords with reports that the expression of MHV or 
SARS-CoV E protein enhanced the permeability of bacterial 
or mammalian cells (34, 31); such a capability has led some to 
classify E among a set of small viral proteins termed viroporins 
(20). Very recently, the channel-forming ability of E protein 
was shown to extend to the E proteins of human coronavirus 
229E, MHV, and IBV (50). This property therefore general¬ 
izes to all three coronavirus groups. Interestingly, channels 
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formed by E proteins of group 2 (MHV and SARS-CoV) and 
group 3 (IBV) coronaviruses exhibited a selectivity for sodium 
ions over potassium ions (50, 51). By contrast, the E protein of 
human coronavirus 229E, which, like TGEV, is a member of 
group 1, formed channels that were selective for potassium 
ions over sodium ions. This observation may explain the inac¬ 
tivity of the wild-type TGEV E protein in MHV infection, and 
it may further provide a basis for understanding the nature of 
the transmembrane gain-of-function mutations. 

Our work supports the conclusion that the E proteins of 
coronaviruses possess certain common functions that can be 
provided by a broad, and largely interchangeable, range of 
primary amino acid sequences. Our findings are most consis¬ 
tent with a scenario in which sequence-specific interactions of 
E protein with M protein are not required. We hope to use 
such a framework to obtain further insights into the in vivo 
functions of E protein and the mechanism of this protein’s 
involvement in virion morphogenesis. 
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